CUDA Fortran for Scientists and Engineers by Massimiliano Fatica & Gregory Ruetsch
Author:Massimiliano Fatica & Gregory Ruetsch
Language: eng
Format: epub
ISBN: 9780124169722
Publisher: Elsevier Inc.
Published: 2013-09-15T16:00:00+00:00
3.5.2 Instruction-level parallelism
We have already seen an example of instruction-level parallelism in this book. In the transpose example of Section 3.4, a shared-memory tile of was used in most of the kernels. But because the maximum number of threads per block is 512 on certain devices, it is not possible to launch a kernel with threads per block. Instead, we have to use a thread block with fewer threads and have each thread process multiple elements. In the transpose case, blocks of threads were launched, with each thread processing four elements.
For the example in this section, we can modify the copy kernel to take advantage of instruction-level parallelism as follows:
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Coding Theory | Localization |
Logic | Object-Oriented Design |
Performance Optimization | Quality Control |
Reengineering | Robohelp |
Software Development | Software Reuse |
Structured Design | Testing |
Tools | UML |
Deep Learning with Python by François Chollet(12518)
Hello! Python by Anthony Briggs(9865)
OCA Java SE 8 Programmer I Certification Guide by Mala Gupta(9755)
The Mikado Method by Ola Ellnestam Daniel Brolund(9745)
Dependency Injection in .NET by Mark Seemann(9291)
Algorithms of the Intelligent Web by Haralambos Marmanis;Dmitry Babenko(8256)
Test-Driven iOS Development with Swift 4 by Dominik Hauser(7739)
Grails in Action by Glen Smith Peter Ledbrook(7665)
The Well-Grounded Java Developer by Benjamin J. Evans Martijn Verburg(7514)
Becoming a Dynamics 365 Finance and Supply Chain Solution Architect by Brent Dawson(6739)
Microservices with Go by Alexander Shuiskov(6503)
Practical Design Patterns for Java Developers by Miroslav Wengner(6403)
Test Automation Engineering Handbook by Manikandan Sambamurthy(6380)
Secrets of the JavaScript Ninja by John Resig Bear Bibeault(6376)
Angular Projects - Third Edition by Aristeidis Bampakos(5758)
The Art of Crafting User Stories by The Art of Crafting User Stories(5292)
NetSuite for Consultants - Second Edition by Peter Ries(5235)
Demystifying Cryptography with OpenSSL 3.0 by Alexei Khlebnikov(5052)
Kotlin in Action by Dmitry Jemerov(5017)
